Syllables, Morphemes and Bayesian Computational Models of Acquiring a Word Grammar
نویسندگان
چکیده
We report a computational study on the CHILDES database for learning a word grammar of Turkish nouns. The syllable-based model converges to a morpheme-based model in terms of overlaps in the set of lexical hypotheses. Morphology is a hidden variable in all models, and the search problem for hypotheses is narrowed down by a probabilistic conception of universal grammar à la Combinatory Categorial Grammar. The convergence of the syllable model suggests that morphemehood can be an emergent computational property.
منابع مشابه
تکیه در زبان فارسی
Abstract: This research has been carried out in the framework of Auto segmental-metrical (AM) phonology to study the stress in Persian. Two types of abstract and concrete prominences were distinguished in which the first one refers to the stress and the second one refers to the pitch accent. Stress is assumed to be a lexical property of the lexemes, but pitch accent is assumed to be an intonati...
متن کاملThe changing status of 'filler syllables' on the way to grammatical morphemes.
The appearance of 'filler syllables' (called here PAEs, for Prefixed Additional Elements) in the late single-word period is analysed in relation to the emergence of grammatical morphemes, by confronting data from the longitudinal study of one child acquiring French, video-recorded between 1;3.2 and 2;2.6, with four hypotheses making different claims about the kind of language knowledge underlyi...
متن کاملAccuracy Order of Grammatical Morphemes in Persian EFL Learners: Evidence for and against UG
This study addresses the acquisition of the morphological markers in Persian learners of English as a foreign language. To this end, the accuracy order of nine morphemes including plural –s, progressive –ing, copula be, auxiliary be, irregular past tense, regular past tense –ed, third person –s, possessive -ʼs and indefinite articles was studied in 6...
متن کاملProbabilistic modelling of morphologically rich languages
This thesis investigates how the sub-structure of words can be accounted for in probabilistic models of language. Such models play an important role in natural language processing tasks such as translation or speech recognition, but often rely on the simplistic assumption that words are opaque symbols. This assumption does not fit morphologically complex language well, where words can have rich...
متن کاملSyllable weight encodes mostly the same information for English word segmentation as dictionary stress
Stress is a useful cue for English word segmentation. A wide range of computational models have found that stress cues enable a 2-10% improvement in segmentation accuracy, depending on the kind of model, by using input that has been annotated with stress using a pronouncing dictionary. However, stress is neither invariably produced nor unambiguously identifiable in real speech. Heavy syllables,...
متن کامل